Overview

Dataset Statistics

Number of Variables 26
Number of Rows 2216
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 182
Duplicate Rows (%) 8.2%
Total Size in Memory 838.8 KB
Average Row Size in Memory 387.6 B
Variable Types
  • Numerical: 14
  • Categorical: 12

Dataset Insights

mntfruits and mntsweetproducts have similar distributions Similar Distribution
income is skewed Skewed
mntwines is skewed Skewed
mntfruits is skewed Skewed
mntmeatproducts is skewed Skewed
mntfishproducts is skewed Skewed
mntsweetproducts is skewed Skewed
mntgoldprods is skewed Skewed
numdealspurchases is skewed Skewed
numwebpurchases is skewed Skewed
numcatalogpurchases is skewed Skewed
numstorepurchases is skewed Skewed
numwebvisitsmonth is skewed Skewed
Dataset has 182 (8.21%) duplicate rows Duplicates
dt_customer has a high cardinality: 662 distinct values High Cardinality
kidhome has constant length 1 Constant Length
teenhome has constant length 1 Constant Length
dt_customer has constant length 10 Constant Length
acceptedcmp3 has constant length 1 Constant Length
acceptedcmp4 has constant length 1 Constant Length
acceptedcmp5 has constant length 1 Constant Length
acceptedcmp1 has constant length 1 Constant Length
acceptedcmp2 has constant length 1 Constant Length
complain has constant length 1 Constant Length
response has constant length 1 Constant Length
mntfruits has 395 (17.82%) zeros Zeros
mntfishproducts has 379 (17.1%) zeros Zeros
mntsweetproducts has 413 (18.64%) zeros Zeros
numcatalogpurchases has 576 (25.99%) zeros Zeros
  • 1
  • 2
  • 3

Variables

year_birth

numerical

Approximate Distinct Count 59
Approximate Unique (%) 2.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 1968.8204
Minimum 1893
Maximum 1996
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • year_birth is skewed left (γ1 = -0.3534)

Quantile Statistics

Minimum 1893
5-th Percentile 1950
Q1 1959
Median 1970
Q3 1977
95-th Percentile 1988
Maximum 1996
Range 103
IQR 18

Descriptive Statistics

Mean 1968.8204
Standard Deviation 11.9856
Variance 143.6535
Sum 4.3629e+06
Skewness -0.3534
Kurtosis 0.7303
Coefficient of Variation 0.006088
  • year_birth has 3 outliers

education

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 156.9 KB
  • The largest value (Graduation) is over 2.32 times larger than the second largest value (PhD)

Length

Mean 7.5194
Standard Deviation 2.8446
Median 10
Minimum 3
Maximum 10

Sample

1st row Graduation
2nd row Graduation
3rd row Graduation
4th row Graduation
5th row PhD

Letter

Count 16263
Lowercase Letter 13566
Space Separator 200
Uppercase Letter 2697
Dash Punctuation 0
Decimal Number 200
  • The top 2 categories (Graduation, PhD) take over 50.0%
  • The largest value (graduation) is over 2.32 times larger than the second largest value (phd)

marital_status

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Memory Size 156.0 KB

Length

Mean 7.0758
Standard Deviation 0.8497
Median 7
Minimum 4
Maximum 8

Sample

1st row Single
2nd row Single
3rd row Together
4th row Together
5th row Married

Letter

Count 15680
Lowercase Letter 13458
Space Separator 0
Uppercase Letter 2222
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Married, Together) take over 50.0%

income

numerical

Approximate Distinct Count 1974
Approximate Unique (%) 89.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 52247.2514
Minimum 1730
Maximum 666666
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • income is skewed right (γ1 = 6.7589)

Quantile Statistics

Minimum 1730
5-th Percentile 18985.5
Q1 35303
Median 51381.5
Q3 68522
95-th Percentile 84130
Maximum 666666
Range 664936
IQR 33219

Descriptive Statistics

Mean 52247.2514
Standard Deviation 25173.0767
Variance 6.3368e+08
Sum 1.1578e+08
Skewness 6.7589
Kurtosis 159.274
Coefficient of Variation 0.4818
  • income is not normally distributed (p-value 2.634936839657269e-10)
  • income has 8 outliers

kidhome

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 142.8 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 1
3rd row 0
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2216
  • The top 2 categories (0, 1) take over 50.0%
  • kidhome has words of constant length

teenhome

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 142.8 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 1
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2216
  • The top 2 categories (0, 1) take over 50.0%
  • teenhome has words of constant length

dt_customer

categorical

Approximate Distinct Count 662
Approximate Unique (%) 29.9%
Missing 0
Missing (%) 0.0%
Memory Size 162.3 KB

Length

Mean 10
Standard Deviation 0
Median 10
Minimum 10
Maximum 10

Sample

1st row 04-09-2012
2nd row 08-03-2014
3rd row 21-08-2013
4th row 10-02-2014
5th row 19-01-2014

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 4432
Decimal Number 17728
  • dt_customer has words of constant length

recency

numerical

Approximate Distinct Count 100
Approximate Unique (%) 4.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 49.0126
Minimum 0
Maximum 99
Zeros 28
Zeros (%) 1.3%
Negatives 0
Negatives (%) 0.0%
  • recency is skewed right (γ1 = 0.0016)

Quantile Statistics

Minimum 0
5-th Percentile 4
Q1 24
Median 49
Q3 74
95-th Percentile 94
Maximum 99
Range 99
IQR 50

Descriptive Statistics

Mean 49.0126
Standard Deviation 28.9484
Variance 838.0071
Sum 108612
Skewness 0.001647
Kurtosis -1.1998
Coefficient of Variation 0.5906

mntwines

numerical

Approximate Distinct Count 776
Approximate Unique (%) 35.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 305.0916
Minimum 0
Maximum 1493
Zeros 13
Zeros (%) 0.6%
Negatives 0
Negatives (%) 0.0%
  • mntwines is skewed right (γ1 = 1.1699)

Quantile Statistics

Minimum 0
5-th Percentile 3
Q1 24
Median 174.5
Q3 505
95-th Percentile 1000.25
Maximum 1493
Range 1493
IQR 481

Descriptive Statistics

Mean 305.0916
Standard Deviation 337.3279
Variance 113790.1257
Sum 676083
Skewness 1.1699
Kurtosis 0.5787
Coefficient of Variation 1.1057
  • mntwines is not normally distributed (p-value 4.008000246793743e-22)
  • mntwines has 35 outliers

mntfruits

numerical

Approximate Distinct Count 158
Approximate Unique (%) 7.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 26.356
Minimum 0
Maximum 199
Zeros 395
Zeros (%) 17.8%
Negatives 0
Negatives (%) 0.0%
  • mntfruits is skewed right (γ1 = 2.1002)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 2
Median 8
Q3 33
95-th Percentile 122.25
Maximum 199
Range 199
IQR 31

Descriptive Statistics

Mean 26.356
Standard Deviation 39.7939
Variance 1583.5558
Sum 58405
Skewness 2.1002
Kurtosis 4.0422
Coefficient of Variation 1.5099
  • mntfruits is not normally distributed (p-value 3.181715182890454e-21)
  • mntfruits has 246 outliers

mntmeatproducts

numerical

Approximate Distinct Count 554
Approximate Unique (%) 25.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 166.9959
Minimum 0
Maximum 1725
Zeros 1
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • mntmeatproducts is skewed right (γ1 = 2.0242)

Quantile Statistics

Minimum 0
5-th Percentile 4
Q1 16
Median 68
Q3 232.25
95-th Percentile 687.5
Maximum 1725
Range 1725
IQR 216.25

Descriptive Statistics

Mean 166.9959
Standard Deviation 224.2833
Variance 50302.9864
Sum 370063
Skewness 2.0242
Kurtosis 5.0414
Coefficient of Variation 1.343
  • mntmeatproducts is not normally distributed (p-value 6.546299470778271e-22)
  • mntmeatproducts has 174 outliers

mntfishproducts

numerical

Approximate Distinct Count 182
Approximate Unique (%) 8.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 37.6376
Minimum 0
Maximum 259
Zeros 379
Zeros (%) 17.1%
Negatives 0
Negatives (%) 0.0%
  • mntfishproducts is skewed right (γ1 = 1.9151)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 3
Median 12
Q3 50
95-th Percentile 169
Maximum 259
Range 259
IQR 47

Descriptive Statistics

Mean 37.6376
Standard Deviation 54.7521
Variance 2997.7905
Sum 83405
Skewness 1.9151
Kurtosis 3.0668
Coefficient of Variation 1.4547
  • mntfishproducts is not normally distributed (p-value 1.5951380466361204e-21)
  • mntfishproducts has 222 outliers

mntsweetproducts

numerical

Approximate Distinct Count 176
Approximate Unique (%) 7.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 27.0289
Minimum 0
Maximum 262
Zeros 413
Zeros (%) 18.6%
Negatives 0
Negatives (%) 0.0%
  • mntsweetproducts is skewed right (γ1 = 2.1019)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 1
Median 8
Q3 33
95-th Percentile 125.25
Maximum 262
Range 262
IQR 32

Descriptive Statistics

Mean 27.0289
Standard Deviation 41.072
Variance 1686.9129
Sum 59896
Skewness 2.1019
Kurtosis 4.0942
Coefficient of Variation 1.5196
  • mntsweetproducts is not normally distributed (p-value 1.4074646768059784e-22)
  • mntsweetproducts has 246 outliers

mntgoldprods

numerical

Approximate Distinct Count 212
Approximate Unique (%) 9.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 43.9653
Minimum 0
Maximum 321
Zeros 61
Zeros (%) 2.8%
Negatives 0
Negatives (%) 0.0%
  • mntgoldprods is skewed right (γ1 = 1.838)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 9
Median 24.5
Q3 56
95-th Percentile 165.25
Maximum 321
Range 321
IQR 47

Descriptive Statistics

Mean 43.9653
Standard Deviation 51.8154
Variance 2684.8372
Sum 97427
Skewness 1.838
Kurtosis 3.1465
Coefficient of Variation 1.1786
  • mntgoldprods is not normally distributed (p-value 3.2757664011903303e-13)
  • mntgoldprods has 205 outliers

numdealspurchases

numerical

Approximate Distinct Count 15
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 2.3236
Minimum 0
Maximum 15
Zeros 44
Zeros (%) 2.0%
Negatives 0
Negatives (%) 0.0%
  • numdealspurchases is skewed right (γ1 = 2.4136)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 1
Median 2
Q3 3
95-th Percentile 6
Maximum 15
Range 15
IQR 2

Descriptive Statistics

Mean 2.3236
Standard Deviation 1.9237
Variance 3.7007
Sum 5149
Skewness 2.4136
Kurtosis 8.9515
Coefficient of Variation 0.8279
  • numdealspurchases is not normally distributed (p-value 4.736020049194525e-19)
  • numdealspurchases has 84 outliers

numwebpurchases

numerical

Approximate Distinct Count 15
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 4.0853
Minimum 0
Maximum 27
Zeros 48
Zeros (%) 2.2%
Negatives 0
Negatives (%) 0.0%
  • numwebpurchases is skewed right (γ1 = 1.1962)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 2
Median 4
Q3 6
95-th Percentile 9
Maximum 27
Range 27
IQR 4

Descriptive Statistics

Mean 4.0853
Standard Deviation 2.741
Variance 7.5128
Sum 9053
Skewness 1.1962
Kurtosis 4.0602
Coefficient of Variation 0.6709
  • numwebpurchases is not normally distributed (p-value 2.063614198663306e-08)
  • numwebpurchases has 3 outliers

numcatalogpurchases

numerical

Approximate Distinct Count 14
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 2.671
Minimum 0
Maximum 28
Zeros 576
Zeros (%) 26.0%
Negatives 0
Negatives (%) 0.0%
  • numcatalogpurchases is skewed right (γ1 = 1.8798)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 2
Q3 4
95-th Percentile 9
Maximum 28
Range 28
IQR 4

Descriptive Statistics

Mean 2.671
Standard Deviation 2.9267
Variance 8.5658
Sum 5919
Skewness 1.8798
Kurtosis 8.0462
Coefficient of Variation 1.0957
  • numcatalogpurchases is not normally distributed (p-value 7.078332938838988e-14)
  • numcatalogpurchases has 23 outliers

numstorepurchases

numerical

Approximate Distinct Count 14
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 5.801
Minimum 0
Maximum 13
Zeros 14
Zeros (%) 0.6%
Negatives 0
Negatives (%) 0.0%
  • numstorepurchases is skewed right (γ1 = 0.7014)

Quantile Statistics

Minimum 0
5-th Percentile 2
Q1 3
Median 5
Q3 8
95-th Percentile 12
Maximum 13
Range 13
IQR 5

Descriptive Statistics

Mean 5.801
Standard Deviation 3.2508
Variance 10.5676
Sum 12855
Skewness 0.7014
Kurtosis -0.6278
Coefficient of Variation 0.5604
  • numstorepurchases is not normally distributed (p-value 1.8453706679502467e-11)

numwebvisitsmonth

numerical

Approximate Distinct Count 16
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 34.6 KB
Mean 5.319
Minimum 0
Maximum 20
Zeros 10
Zeros (%) 0.5%
Negatives 0
Negatives (%) 0.0%
  • numwebvisitsmonth is skewed right (γ1 = 0.2179)

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 3
Median 6
Q3 7
95-th Percentile 8
Maximum 20
Range 20
IQR 4

Descriptive Statistics

Mean 5.319
Standard Deviation 2.4254
Variance 5.8824
Sum 11787
Skewness 0.2179
Kurtosis 1.8457
Coefficient of Variation 0.456
  • numwebvisitsmonth is not normally distributed (p-value 5.0196470844331725e-08)
  • numwebvisitsmonth has 8 outliers

acceptedcmp3

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 142.8 KB
  • The largest value (0) is over 12.6 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2216
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 12.6 times larger than the second largest value (1)
  • acceptedcmp3 has words of constant length

acceptedcmp4

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 142.8 KB
  • The largest value (0) is over 12.51 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2216
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 12.51 times larger than the second largest value (1)
  • acceptedcmp4 has words of constant length

acceptedcmp5

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 142.8 KB
  • The largest value (0) is over 12.68 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2216
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 12.68 times larger than the second largest value (1)
  • acceptedcmp5 has words of constant length

acceptedcmp1

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 142.8 KB
  • The largest value (0) is over 14.61 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2216
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 14.61 times larger than the second largest value (1)
  • acceptedcmp1 has words of constant length

acceptedcmp2

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 142.8 KB
  • The largest value (0) is over 72.87 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2216
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 72.87 times larger than the second largest value (1)
  • acceptedcmp2 has words of constant length

complain

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 142.8 KB
  • The largest value (0) is over 104.52 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2216
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 104.52 times larger than the second largest value (1)
  • complain has words of constant length

response

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 142.8 KB
  • The largest value (0) is over 5.65 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2216
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 5.65 times larger than the second largest value (1)
  • response has words of constant length

Interactions

Correlations

Missing Values